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Abstract 

We present a general approach to the problem of determining tight asymptotic lower bounds 
for generalized central moments of the optimal alignment score of two independent sequences 
of i.i.d. random variables. At first, these are obtained under a main assumption for which 
sufficient conditions are provided. When the main assumption fails, we nevertheless develop a 
“uniform approximation” method leading to asymptotic lower bounds. Our general results are 
then applied to the length of the longest common subsequence of binary strings, in which case 
asymptotic lower bounds are obtained for the moments and the exponential moments of the 
optimal score. As a byproduct, a local upper bound on the rate function associated with the 
length of the longest common subsequences of two binary strings is also obtained. 
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1 Introduction 

Throughout this paper, X n := (Xi, X 2 , ■ ■., X n ) and Y n := (Yi, Y 2 , ..., Y n ) are two random strings, 
usually referred to as the (finite) sequences, so that any random variable X, or Y), i = 1 ,n, 
takes its values in a fixed finite alphabet A. The sequences X„ and Y n are assumed to have the 
same distribution and to be independent. The sample space of X n and Y n will be denoted by X n \ 
clearly X n C A n , but, depending on the model, equality might not hold. 

The problem of measuring the similarity of X n and Y n is central to many areas of applications 
including computational molecular biology (cf. 0, m, m, EH and m) and computational 
linguistics (cf. [33], [23], p3] and [25]). In this paper we adopt the notation of [20], namely, we 
consider a general scoring scheme, where S : Ax A ^ M + is a symmetric pairwise scoring function, 
that assigns a score to each pair of letters from A. Throughout, an alignment is a pair (w, /x), where 
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7r := (7Ti, 7T 2 , • • •, vrfc) and n := (/ii, /i 2 ,..., fik) are two increasing sequences of positive integers such 
that 1 < 7Ti < 7 t 2 < • • • < 7Tfc < n and 1 < fi\ < // 2 < ■ ■ ■ < Hk < n. The integer k is the number of 
aligned letters, and n — A; is the number of gaps in the alignment. Note that our definition of gap 
differs from the one that is commonly used in the sequence alignment literature. More precisely, 
in most of the literature, a gap is a block of consecutive indels (insertion and deletion, formally 
denoted by one or more consecutive in both strings, and it depends on the alignment. For 
example, the left alignment below has four gaps, while the right one has five gaps. 

11312 — — — 3 11312—3 — — 

1-3 — 2111- 1-3-21-11 

Since we consider sequences of equal length, and since we do not have a gap opening penalty (which 
refers to a constant cost to open a gap of any length), our gap corresponds to a pair of indels, one 
on the X-side and the other on the Y-side. In other words, the number of gaps in this sense is the 
number of indels in either one of the sequences. Hence, in our framework, both the left and the 
right alignment above have three gaps. 

Given a symmetric pairwise scoring function M + and a constant gap price 5 £ R, 

the score of the alignment (7r, /x), when aligning X n and Y n , is defined by 

k 

(X„; Y n ) := Yi S(X ni , Y w ) + 5(n - k ). 

i— 1 

In our general scoring scheme 5 can be positive, although usually S < 0 penalizes matches with . 
For negative 5, the quantity —5 is usually called the gap penalty. The optimal alignment score of 
X n and Y n is then defined as 


Tn — T n (X n , Y n ) .— max I7(_ m(X n , Y n ), 

O./d 

where the maximum is taken over all possible alignments. To simplify notations, in what follows, 
we often set Z n := (X,„ Y n ) so that L n = L n {Z n ). When 5 = 0 and the scoring function assigns 
the value one to every pair of common letters and zero to all other pairs, i.e., 


5(o,6) 


1, if a = 6; 
0, if a / b, 


then L n (Z n ) is just the maximal number of pairs of aligned letters - the length of the longest 
common subsequences (LCS) of X n and Y n , which is probably the most frequently used measure 
of global similarity. 


1.1 A Summary of Known Results 

The study of L n is not only of theoretical interest but also has some practical consequences. For 
example, it is useful for distinguishing related (homologous) pairs of strings from unrelated ones. 
Unfortunately, for fixed n and an arbitrary distribution for X n and Y n , the distribution of L n 
is unknown. In view of these, an alternative approach is to study the typical values and the 
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fluctuations of L n . When X n and Y n are taken from an ergodic process, by Kingman’s subadditive 
ergodic theorem, there exists a constant 7 * such that 

— —>■ 7 * a. s. and in L\, as n —>■ 00 . (1.1) 

n 

In the LCS case, the existence of 7 * was first shown by Chvatal and Sankoff [Sj, but its exact value 
(or an identity for it) remains unknown even for independent i.i.d. Bernoulli sequences (although 
numerically estimated). Alexander [T] obtained the rate of convergence in ( 11 . 11 ) in the LCS case, 
which was extended to general scoring functions in [ 20 ], and to multiple (two or more) independent 

i.i.d. sequences with general score function in [12j . 

Since the exact distribution of L n is rather hard to determine even for moderate n, it is natural 
to look for a limiting theorem, e.g., 


L n — IE(L n ) 
n a 



n —>• 00 , 


for some a € (0,1). Here V is a limiting distribution, and —> stands for convergence in law. 
Typically, one expects a = 1/2, and V to be a centered normal distribution, i.e., 


L n ~ IE (L n ) d r 


n —>• 00 , 


( 1 . 2 ) 


where AT stands for a centered normal distribution with variance a 2 > 0 . Under G2D, for any 
r > 0, the r-th absolute moment of L n would then be expected to grow at speed n r//2 , as n —» 00 . 
In particular, the variance would grow linearly, i.e., 


lim 

n—too 


Var (L n ) 
n 


= a 2 > 0 , 


and then, (II. 2 p would be equivalent to 


L n E (L n ) 0 . <-/„ -1 \ . /. 

7 = fT . —> A (0,1), n —>• 00 . (1.3) 

y Var(L n ) 

Such a limiting theorem would allow to construct asymptotic tests, based on the optimal score L n , 
hypothesizing that two given sequences X n and Y n are independent (not homologous) or not. 

Note that in the gapless case, i.e., when 5 = — 00 , the optimal score is just the sum of the 
pairwise scores L n = Y^i =1 S'(Xj,Kj), and thus, under rather general assumptions on X n and Y n , 
the limiting theorem (11.31) (equivalently, (11.21) ) holds true. This observation suggests to conjecture 
that (11.31) also holds in more general cases, in particular, in the LCS case. No such type of limiting 
theorem was known until HE which proved (11.31) in the LCS case, when X n and Y n are independent 

i.i.d. random sequences such that Var(L n ) admits a sublinear (in n) lower bound. The result in [15J 
upper-bounds the Monge-Kantorovich-Wasserstein distance, and in turn the Kolmogorov distance, 
implying weak convergence. The limiting theorem in na was extended to multiple independent 

i.i.d. sequences with a general score function in [12]. In contrast to na, the result in [ 12 ] directly 
upper-bounds the Kolmogorov distance, which improves the rate of weak convergence, but still 
requires a (looser) sublinear variance lower bound. Hence, establishing a linear-order variance lower 
bound for a general distribution of X\ implies immediately the corresponding limiting theorem (11.31) . 
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Therefore, there is a direct connection between a variance lower bound and the limiting theorem 
», leading to an extra motivation to study the rate of convergence of Var(L n ) as well as all the 
other central moments of L n . 

The study of the asymptotic order of Var(L n ) in the case of LCS was first proposed by Chvatal 
and Sankoff [8], who conjectured that Var(L n ) = o(n 2 / 3 ), for X n and Y n independent i.i.d. sym¬ 
metric Bernoulli sequences, based on Monte-Carlo simulations. By the Efron-Stein inequality, in 
the case of independent i.i.d. sequences with score functions satisfying (12.111 below, there exist a 
universal constant (independent of n) C 2 > 0, such that 

Var(L n ) < C^n, for all n € N. (1.4) 

(see Section [2] for more details). For the LCS case, this result was proved by Steele [29]. In [51] , 
Waterman asked whether or not the linear bound on the variance can be improved, at least in the 
LCS case. His simulations showed that, in some special cases (including the LCS case), Var(L n ) 
should grow linearly in to. These simulations suggest a linear lower bound on Var(L n ), which would 
invalidate the conjecture of Chvatal and Sankoff. In the past ten years, the asymptotic behavior of 
Var(L n ) has been investigated under various choices of sequences X n and Y n (cf. [5], [U], [IT], [T9] . 
[21], [2j> [3], etc). In particular, in [21], the asymptotic behavior of Var(L n ) is investigated within 
two scenarios. In the first one, a general scoring function with a high gap penalty is considered, 
and X„ and Y n are independent i.i.d sequences drawn from a finite alphabet (see also Theorem 
16.21 and Remark 16.31 below). In the second one, the LCS case is studied when both X n and Y n 
are binary sequence having a multinomial block structure, which is a generalization of the model 
in [30j (the first analysis of the so-called high entropy case). In many ways, the present paper 
complements [21] . The results of [T9] (the so-called low entropy case) were generalized in m , and a 
linear variance lower bound was proved for an arbitrary finite alphabet (not just binary) and for the 
central moments of arbitrary order (not just the variance), but still under a strongly asymmetric 
distribution over the letters. 

The goal of the present paper is to study in this general framework the lower bounds on general¬ 
ized moments of the optimal alignment score L n . To start with, in the next section we quickly show 
how to obtain upper moments and exponential bounds on L n . In Section [3[ we develop a general 
argument to find lower bounds for the generalized central moments of L n . These lower bounds are 
obtained under a main assumption, and sufficient conditions for the validity of this main assump¬ 
tion are provided. Next, we relax some of these conditions (so that the main assumption is no 
longer satisfied), and develop a “uniform approximation” method giving rougher lower bounds for 
the generalized central moments of L n . In Section [H the general results are applied to the binary 
LCS case, and some refinements of those lower bounds are also presented. Finally, in Section [5[ we 
obtain a lower bound on some moment generating function, which is then used to derive a (local) 
upper bound on the rate function associated with the binary LCS case. 

2 Upper Bounds on Moments and Exponential Moments 

In this short section, we briefly review the upper bounds on central moments and exponential 
moments of L n . Recall that for any constant gap price 5 € R, changing the value of one of the 2 n 
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random variables Xi ,..., X n and Y\,... ,Y n changes the value of L n by at most K, where 

K : = max \S(u,v) — S(u,w)\. 

U.V.W^A 


( 2 . 1 ) 

( 2 . 2 ) 


Thus, by Hoeffding’s exponential martingale inequality, for every t > 0, 

P(|L„-E(L n )| >t) <2e- t2 ^ nK2 \ 
which indicates that V n := (L n — E (L n ))/y/n is subgaussian. Hence, for any r > 0, 
E(|14D<r2 r / 2 (9A^ 2 ) r/2 r(0 =r(18) r / 2 r(0 K r . 

The above inequality provides an upper estimate, of the correct order, on the central absolute 
moments of L n . Some further simple and direct computations allow to improve the constant. Let 
r > 2, x > 0, and let V n := \L n — E(L n )|. From (12.21) . 

/*00 /‘OO 

E (v^ = J P (y n > t i/r j dt < x + 2 J e -t 2/r /ini< 2 ) dt 
Minimizing in x, i.e., taking x = (/l 2 (ln2)n)^ 2 , and changing variables u = t 2 ^ r /(K 2 n), lead to 


E(V£ <K r 


- U u r l 2 - X du 


n r/2 =: C(r) n r/2 . 


r/2 


f c 

(In 2 ) r/2 + r / 

J In 2 

When x = 0, the corresponding constant is slightly bigger than C(r), and is given by 

D(r) := rK r e~ u u rl2 ~ x du = rK r T . 

Remark 2.1. For the LCS case, the following constant was obtained, as a consequence of a general 
tensorization inequality, in [16] : 

E(r) := (r - {X 1 / Yi). 

For r close to 2, E{r) is smaller than C{r). Indeed, 

E(2) = P (X\ / Y\) < 1, C{ 2) = l + ln2, D{ 2) = 2. 

However, for r > 2 large, A(r) might be bigger, e.g., 

A(4) = 162 P (X 1 / Hi), D( 4) = 4T(2) = 4. 

In addition to moment estimates, the above methodology also provides an upper bound on the 
moment generating function of |V^| = V n /y/n = \L n — E(L n )|/y / n. Let t > 0 and r > 1. Then, with 
K = 1 and using (12. 2 j) . 


P (VlW > r'j = p (^v n > 


^Jn\nr 


<2e-^rY A 2 . 


Thus, 


/ oo /*oo / 2 \ 

e -( lnr ) / f2 dr = 1 + 2 J exp 

< 1 + ^1 + erf ^ e* 2 / 4 < 1 + 2(2.3) 


where erf(-) denotes the Gaussian error function. 
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Remark 2.2. The upper bound on the moment generating function can clearly also be found 
through the upper bounds on the moments. Indeed, when K = 1 and with the bound 

E(|I4D < D(r) = rT Q = 2T = 2^E (|e| r+1 ) , r G N, 

where £ is a AA(0,1/2) random variable, it follows that 


= 1 + ^E(|y n | r )- < 1 + 2^E E 


r=1 


r —1 


r+1 


2V? 


H“ 1+ t 


OO 

E fcE ( |e| ' 


k=2 


fc!' 


Since, 


i 00 i i °o 

E (|«| e «l) = E(|5!) + 7 X>E (|*|‘) = + f=E (|*| 

fc=2 ' V k=2 


kV 


while, 


E(j£|e t|?l ) = = 


d 

dt 


1 + er/mV 2 / 4 


t 


= o ( 1 + er /( n ) ) ef2/4 + 


7T 


the upper bound in (|2.3|) is recovered: 

E < 1 + 2^ (E (|£| e t|?l ) - E (]£])) 



e t2 /\ 


3 Lower Bounds on Generalized Moments 


Let <L : M + —> M + be a convex, nondecreasing function. The main objective of this section is to find 
a lower bound for the expectation 

E ($ (|L n (X n ; Y n ) - E (L n (X n ; Y n ))|)) , (3.1) 


where, again 


En — T n (X n , Y n ) — max „)(X n , Y n ), 

(^.m) 

and where again the maximum is taken over all possible alignments. 

Remark 3.1. In what follows, we deal with lower bounds on (13.11) with no further restrictions on <L. 
This is somehow different from the upper bound case, where to the best of our knowledge, there is 
no uniform approach that applies to every convex nondecreasing <L. Indeed, in the previous section, 
we established upper bounds for <L(x) = |x| r , r > 0, and 4>(x) = e tx , t > 0. As shown in [16], Efron- 
Stein type inequalities can be applied to <I>(x) = |x| r , r > 0, via the Burkholder’s inequality, giving 
the aforementioned constant E(r). Another generalization of the Efron-Stein inequality, given, for 
example, in [ 6 ] Chapter 14], is the subadditivity inequality of the so called 4>-Entropy. However, it 
only applies to those functions having strictly positive second derivative and such that l/<&" is 
concave, e.g., 4>(x) = |x| r , for r G (1,2], or <L(x) = xlogx. Moreover, the upper bounds on (13.11) for 
those are linear in n as easily seen via, e.g., [ 6 ] Theorem 14.6], together with (12.11) . In [ 6 ] Chapter 
15], some further generalizations of Efron-stein inequality for 4>(x) = |x| r , r > 2, are also obtained, 
which, together with ( 12 . 11 ) . also imply upper bounds of order n r / 2 on the r-th central moments (cf. 
Theorem 15.5 therein). 
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3.1 The Random Variable U n and the Set U n 

Recall that X n is the sample space of X n and Y n , so that X n x X n is the sample space of Z n = 
(X n ;Y n ). In the sequel, we consider a function u : X n x X n —>• Z, so that U n := u(Z n ) is an 
integer-valued random variable. Let S n be the support of U n , and for every u G S n , let 

l n {u) : = E (L n {Z n ) | U n = u), 

and set fi n := E(L n (Z n )). Since x i->- <h(|x — g n \) is convex, 

E ($ (|L n (Z n ) - /i n |) I u n )>$ (|E (L„(Z n )| Un) - /in|) = $ (\£ n (U n ) - /ini) , 

so that 

E ($ (|L n (Z n ) - /ini)) > E ($ (|4(Rn) ~ Mn|)) ■ 

Let 5 > 0, and let the set U n C 5 n be such that 

l n [u 2 ) — ^n(^i) > 5, for any tii, 1*2 £ kin with tti < ii 2 - (3-2) 

Since 5 n is finite, so is Z7 n . Therefore, set 

U n := {iti,..., ii m } and k 0 : = max (u i+ i - uf) , (3.3) 

i= 1, ••• ,m—1 

where u\ < U 2 < ■ ■ ■ < u m and where m = m(n) depends on n. Clearly, except perhaps in some 
very special cases, such a set U n formally always exists (it might even consists of two elements). As 
we will see, U n becomes useful if P (U n GlA n ) > Co, for 5 and Co independent of n. 

Lemma 3.2. Under (13.2p . for every Ui,Uj G U n , 

£ 

l^n(iij) (iij ) | ^ 7 'U'j I ' 

«0 

Therefore, there exists a n G [iii,ii m ], such that for every Ui G U n , 

£ 

l^n(iij) /in| ^ 7 l^i ®n| • (3.4) 

k 0 

Proof. Let g : [ui,u m \ —> M + be a differentiable function, such that g{uf) = £ n (ui ), for every 
i = 1,... ,m, and such that g'(x) > 6/ko, for every x G (ui,u m ). Then, one of the following three 
mutually exclusive possibilities holds: 

• /in ^ £-n (ii 1) : 

• fn(iil) /in (L fn(iim)j 

• fn(iim) ^ /in' 

In the first case, for every m G U n , i = 1,..., m, 

|Ci(iii) /in| — fn(iij) Mn — l) — 7 (^i iil) j 

ko 
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so (13.411 holds with a n = u\. The third case can be handled similarly with a n = u m . For the middle 
case, since g is increasing and continuous, there exists a n € [ui,u m \ such that g n = g(a n ). Hence, 
by the mean value theorem, 


\^n{ u i) l^n | — | giV'i) £((®n)| ^ , I > 

ko 

completing the proof. 

Now, in view of (13. 411 . since <I> is monotone and non-negative, 


□ 


E($(|4(£/n)-Ain|))> $(K{u i )-lin\)W(U n =U i )> ^ $ (y K - a n \\ P (U n = uf) , 


Ui Mi £ lAn 

i.e., the lower bound on the generalized moment is of the form 


E(<F(|L„(Z n ) - /x n |)) > E($(|4(H„) - n n \)) > X] $ (^\ui - a n \JF(U n = Ui) (3.5) 

«iG Un ^ 0 ' 

U n € U n ) P (U n € U n ). (3.6) 


= E( d>( — \U n -a n \ 


When <3?(x) = |x| r , for some r > 1, then (13.61) becomes 


E (\L n (Z n ) - g n \ r ) > ( — 1 E (|[/ n — a n | r | € U n ) P (U n € U n ) 

If P (U n € U n ) > Co > 0, with 5, ko and co all independent of n, then 

<5 


E ( \L n (Z n ) — g n \ r ) > 


c 0 E ( \U n - a n \ r | U n € U n ) > 


. c 0 minE (\U n — a\ r | U n G U r ) . 
Kq J aSK 


Thus a lower bound on the centered absolute rth moment of L n {Z n ) is obtained via conditioning 
on U n . Typically, in applications, the random variable U n counts the number of some particular 
letters in Z n , and, therefore, it has a binomial distribution. In this case, the right hand side of (|3.6I) 
is relatively easy to compute. 


Remark 3.3. Note that the argument leading to (13.611 is independent of n. Therefore, (13.511 
continues to hold whenever depends on n, provided <I> n remains convex and non-decreasing. 


3.2 On the Existence of U n and U n 

As shown in the previous subsection, the lower bound on the <h-moment can be obtained via the 
random variable U n: provided that there exists a universal constant 5 > 0 such that the correspond¬ 
ing set U n has a large enough probability. In what follows, we show that the existence of a suitable 
U n can be guaranteed using a random transformation , 

TZ : H x X n x X n . —>• X n x A„, 

such that, for most of the outcomes of Z , TZ increases the score by at least some fixed amount 
Co > 0. Here, with a slight abuse of notation, H is a sample space for the randomness involved in 



1Z. More precisely, the random transformation should be such that there exists a set B n C X n x X n 
having a relatively large probability so that, for every z n € B n , the expected score of TZ(z n ) exceeds 
the score of z n by £q, namely, 


K(L n (TZ(z n ))) > L n ( z n ) +e 0 . 

Above E denotes the expectation over the randomness involved in 1Z (i.e., the integral over fi). The 
requirement that the set B n has a relatively large probability is formalized by requiring that 

A n (£o) := P (E ( L n (7Z(Zi n )) — L n ( Z n ) | Z n ) < £o) y 0, as n — > oo. (3-7) 

We will see that the faster A n (eo) tends to zero, the better the lower bound is. 

The random transformation should be associated with U n and the set U n = {ui,..., u m } so that 
when U n takes its values in U n . then 1Z increases its value in U n and has no effect on the conditional 
distribution of Z n . To formalize this requirement, for every u € U n , let p(“) be the law of Z n given 
U n = u, i.e., 

PW(A)=P(Z„G A\U n = u), A cX n xX n . 

Now, for every Ui, itj+i € U n , the following implication should hold: if Z„ ' is a random vector with 
law p(“^, then 

n (z£*°) (3.8) 

which means that for every z n € X n X X n , 

P(7Z(Z n ) = z n I U n = Ui) = P(z n = z n I U n = u i+ 1 ). 

In particular, (13.81) implies that if u(Z„) = Ui , then u(77(Z n )) = u i+ 1 , but the converse statement 
might not be true. 

Fast Convergence Condition. Our first general lower bound theorem assumes the existence of 
£o so that A n (eo) converges to zero sufficiently fast. To simplify the notation, in what follows, let 
Z n := TZ(Z n ). The following theorem is a generalization of [21], Theorem 2.2], 

Theorem 3.4. Forn € N, let there exist a random variable U n = u(Z n ), a setU n = {ui,..., it m ( n )}, 
a random transformation 1Z satisfying (|3.8I) . such that the following two assumptions hold: 

(i) There exists a universal constant A > 0 such that 

Ln(fZ*n) ~ L n ( Z n ) > — A. (3.9) 

(ii) There exists a universal constant £o > 0 such that 

A n (£o) = o(tp(n)), oo, (3.10) 

where A n (eo) is given by (13.71) . and where ip be a function on N such that 

P (U n = u) > ip(n), for all u € U n . (3-11) 
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Then, there exists a n € [ui,u m ] so that for n large enough, 


E($(|L n (Z n )-/i n |))> Y, ® 

Ui^HAn 



P {U n = Ui )> y ® 

UiEVfn 



<p(n), 


(3.12) 


where 5 = Eq /2 and ko(n) = maxj = i — Uj). 

Proof. Let U{ E U n , and let Z.„ ,,, ' ) be a random vector with law By (13.81) . 

£ n (u i+ i) = E (L„(z^ i} )) . 

Hence, 


E LjZ^n-Ljz^) ZH 


4(^ l+ l) - 4W = E (in(z' Ui) )) - E (intz^ 5 )) =E 
Let £o be as in (ii), and let L> n (eo) C X n x X n be the set of outcomes of Z n such that 
{e ^L n (Z n ) — L n (Z n ) Z n ) > £q| = {Z n € H n (eo)} ■ 


(3.13) 


By (13.91) . for any pair of sequences z n , when applying the transformation TZ , the score can decrease 
by at most —A. Hence 


E 


E (L n ( z£*)) - L n (z&*>) | zW)] > e 0 P (z^ } € 5„(e 0 )) - ^4P t B n {e 0 )) . 


By definition, P(Z n 0 B n {e o)) = A n (eo)- Therefore, by (13.101) and (13.111) . 

P (z&*> 0 S n (e 0 )) = P(Z„ 0 B n (£ 0 ) | U n = m) < p ,f. n(£0) < = o(l), n oo. 

V / P (U n = Ui) ip(n) 

Choose no large enough (depending on eq and A) so that for any n > no, 


eo 1- 


A w (£q) 

(f{n) 


Therefore, for any n > no and any Ui &U n , 


. A„(eo) > £o 
<p(n) ~ 2 ' 


£o 


Ai(^i+l) Ai(^i) A 2 —' 


This proves (13.2j) . which entails (13.4j) and thus (13.121) . The proof is now complete. 


□ 


Remark 3.5. The assumption (13.91) typically holds for any meaningful function u. The difficulties 
in applying Theorem 13.41 lie in finding u, TZ and U n , such that (13.101) holds for a given sq. Since 
typically for every u E S n , P (U n = u) —>■ 0, A n (eo) has to converge to zero sufficiently fast as 
n —>• oo. The larger the probability P(£/ n € U n ) is (recall that it has to be bounded away from zero), 
the smaller ip(n) is, and the faster the convergence to zero of A n (eo) is. 
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3.3 Uniform Approximation 

As mentioned in Remark 13.51 the most important assumption in Theorem 13.41 is the existence of 
the random transformation TZ and of the corresponding random variable U n such that (for some 
£o > 0) the probability A n (eo) converges to zero sufficiently fast. For all purposes, this is often a 
very specific requirement and so far, the existence of suitable 1Z and U n has only been shown for 
very specific models (cf. and [TBJ). It is therefore important to relax the assumption of 

fast convergence, which is the goal of the present subsection. Below, the condition (13. 101) is replaced 
by an arbitrarily slowly convergent sequence A n (eo) —>• 0. 

Theorem 3.6. For every n € N, let there exist a random variable U n = u(Z n ), a set U n = 
{u±, ■ ■ ■, Um(n)} C S n , a random transformation TZ, a positive function on N, and a constant 
A > 0, such that (13.81) . (13.91) and (13.111) are satisfied. Moreover, let there exist a universal constant 
c > 0, such that 


m{n ) > cip (n), for all n € N. 
If for an > 0, A n (eo) —> 0, then for n large enough, 


(3.14) 


fv 3 1 («) 


E (• (IMZ„) - *1)) > £ *(f(g^j+i)) *») + | * (i^j) a \ * . 

Proof. Let eo be as in the assumption of the theorem and let the set Uff C S n be defined as follows: 


uEU® <(=)> P (Z n ^L B n (e o)| U n — u) < y/ A n (eo), 

where B n (e q) is given by (13.131) . Thus, by the very definition of A n (eo), 


(3.15) 


A n (eo) — P (Z n 0 B n (e o)) > P (Z n 0 B n (e o) | U n — u) P (U n — u) > y/ A n (eo) P ( U n 0 , 

which implies that 

P (U n 0Z7°) < y/ A n (e 0 ). (3.16) 

When Ui € U n using an argument as in the proof of Theorem 13.41 and by (13.91) and (13.151) . 

£ n (ui+ 1) - In(Ui) > e 0 (l - \M«( e o)) ~ Ay/ A ra (e 0 ) > y, (3.17) 

provided that n > n\, for some positive integer m large enough. When m 0 U n D U { f , by (13.91) . 

I n {ui+ 1) - 4(«i) > -A. (3.18) 

Assuming n > n\, we are interested in the set U n PiU® which can be represented as the union of 
disjoint subintervals of U n = {u\, ■ ■ ■ , u m }, i.e., U n D= Li/ =1 Ij, where for j = 1,..., N, 


Ij = 


| u/\ ..., , with < • • • < u// < < ■■■ < u// < ■ ■ ■ < uf f < • • • < 


r N ’ 
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are disjoint subintervals of U n . Clearly, the number N of such intervals as well as the intervals 
I\, .... Ix themselves depend on n. On each interval Ij, (13.1711 implies that the function 4 increases 
with a slope of at least eo/2, and moreover, note that if Ui = u^\ then i n (ui + 1 ) 0 Ij. Note also 
that the approach in proving Theorem 13.41 largely applied because U n was a single interval where 
I n increases. In the present, case the set U n n consists of several intervals and between them 
the function t n does not necessarily increase. At first, it might appear that the approach of the 
previous subsection would work, when replacing 


E $ (6 \ui - a n \)¥(U n = 

U , (rl-in 


with 


N 

E $ (5 |Ui - a n \) P {U n = Ui) = ^2 X] ^ \ Ui - a «l) F ( U n = u i) ■ 

J—1 Ui£lj 

Unfortunately, a n would now depend on Ij, and we would have to deal with the sums 

N 


i =i uteij 


Uj — a. 


U) 


P {U n = Ui) . 


To bypass this problem, we proceed differently. Consider the sets 

Jj ■= {4 } J > • • • 1 4 (4j } ) } , 3 = 1, ■ • •, N , 

i.e., Jj is the image of Ij under I n . By (13.171) . all elements of Jj are at least eo/2-apart from each 
other (the intervals Jj might overlap, although the intervals Ij do not). By (13.181) . 

^ (4(«i+i) - 4 (ui)) > -A • Card (U n \Un) =: -Am 0 (n), 

Ui£U n \U° 


implying that the sum of the lengths of the (integer) intervals Jj differs from the length of J : = 
U jLiJj by at most Arrio(n). Formally, 

N 

y (4(4^) -4(y ) )) - \ J\ < Am 0 (n), 

3 = 1 

where | J\ denotes the length of J, i.e., the difference between the largest element and the smallest 
element of J, and I n {ur}) — 44i^) is the length of each Jj. The number of elements eo/2-apart 
needed for covering a (real) interval with length Amo(n) is at most 2Amo(n)/eo + 1. Hence, at 
most 2 A mo (n)/eo + 1 elements eo/2-apart will be lost due to overlappings, which implies that the 
set J contains at least 

\U n \- 2Ama{n] - 1 = m(n) - 2Am ° (n) - 1 

£0 £0 

elements which are (at least) eo/2-apart from each other. Using (13.111) and (I3.16|) . 

V A n (e 0 ) > P (U n G U n \ Un) = y P (U n = u) >m 0 (n)(p(n), 

U£Un\U° 
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and therefore, 


m 0 (n) < \J A n (eo) V 1 (^)- 

Recalling the assumption (|3.14l) . there exists ri 2 € N such that for all n > ri 2 , 


m(n ) — 
Now let 


2dmo(n) 

£o 


— 1 > m(n ) — 


2A\J A n (eo) > ce o — 2A\J A n (eo) ^ c 


£ 0 <p(n) 


£ 0 <p(n) 


> ^ 


-i, 


n) 


(3.19) 


B n ■— % u € lA n . |^ n ('it) fi n | ^ 


£ 0 C 


= Bt u B: 


where 


B n ■— % u S lA n . (, n {u) fi n ^ 


£ 0 C 


16 (p(n) 


16 ip(n) 

and B~ := < u € U n : l n (u) - fi n < ~ 


£ 0 C 


16c p(n) 


Since the interval 




6 0 C 




£ 0 C 


16y?(n) ’ 16y>(n) 

contains at most cc^ _1 (n)/4 elements which are eo/2-apart from each other, and since the set J 
contains at least cc^ _1 (n)/2 elements which are eo/2-apart from each other, it follows that B n 
contains at least c<£> _1 (n)/4 elements (the values of l n being eo/2-apart from each other). This 
implies that at least one of the two sets B+ and B~, say B+, contains at least c</? -1 (n)/8 elements 
whose ^-images are at least eo/2-apart from each other. Let B n be the union of those cc/? _1 (n)/8 
elements in £>+. Then the set B n \ B n consists of at least cc/? _1 (n)/8 elements (whose ^ n -images are 
at least eo/2-apart from each other). Hence, by the monotonicity of T, 

m(n) m(n) 

E $ (I f-niUi) ~ fl n |) P (U n = Ui) > ^ 4 > (| £ n (Ui) ~ Hn\) <p(n) > ^ ^ ^ n (“) ~ Bn\) ^( n ) 


2=1 


2=1 


uefin 


> 


^2 $ (\e n (u) - Hn\) <p(n) + $ (Kn(«) -Hn\)<p(n) 


> 


> 


u£B„ 

-sV-Hn) 

E 

3 =1 V 

%V~\n) 


3 = 1 


$1 ££ 


8 ip(n) 


8 <p(n) 


UGB n \B n 


+3 <p{n) + T 


EqC 


+ J <p(n) + $ 


16c p(n) J 8 


e 0 c 


<P (n)<p(ri) 


(l6v?(n) J 8 


gQC 

4 \ v 16(/?(n) 


□ 


The inequality (13.51) now finishes the proof. 

Remark 3.7. In both Theorem 13.41 and Theorem 13.61 the thresholds, on large n, are independent 
of the choice of the convex function 4>. In Theorem 13.41 the threshold no depends on the sequence 
A n (eo)/ ( p( n ) j but not on 4>. In Theorem 13.61 the threshold on n is obtained in two steps: the first 
threshold ni is to ensure the validity of (13.171) . while the second threshold ri 2 is to guarantee the 
validity of the last inequality in (13.191) . Both steps are independent of the choice of 4>. 
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Remark 3.8. The assumption m(n) = \U n \ > cy> _1 (ra) (for all n £ N), together with (13.111) . 
implies that for every u £ U n , F(U n = u) > </?(n) > cra _1 (n). Hence, U n cannot be too large, 
since all the atoms in lA n have roughly equal masses, justifying our naming this approach “uniform 
approximation”. On the other hand, these two assumptions imply that P (U £ U n ) > c, so that U n 
cannot be too small either. In the next section, we will see that the set lA n satisfying the requirements 
of Theorem 13.61 does exist, but its choice must be made more carefully when compared to the one 
in Theorem 13.41 This is the price to pay for the arbitrarily slow convergence of A n (eo)- 


4 The Binary LCS Case 

As an applications of the general results of the previous section, in the rest of the paper, we consider 
the binary alphabet A = {0,1}, i.e., X„ and Y n are two independent i.i.d. sequences such that 
P(Xi = 1) = P(Yi = 1) = p, p £ (0,1), and L n (X„; Y n ) is the length of the LCS of X n and Y n . Let 
u( z n ) count the zeros in z n = (x„;y n ). Thus U n is equal to the number of zeros in Z n = (X n ; Y n ) 
and therefore U n ~ B(2n, q ), where q = 1 — p, and its support is S n = {0,1,..,, 2n}. 

Let the random transformation 1Z be defined as follows: given z n , choose randomly (with uniform 
probability) a one and turn it into a zero. With this transformation, the condition (13.81) holds for 
any u = 0,..., 2n — 1. To see this, for any u = 0,..., 2n, let A n {u) C {0, l} 2n consist of all the 
binary sequences containing exactly u zeros. For any z n £ A n (u), 


Z77, — Z n I U'n — u] — 


P (Z n = z n , U n = u) P (Z„ = z r 


q u {l-q) 


2 n—u 


P (U n = u) 


P (U n = u) 


2 n 


q u { 1 - q) 


2 n—u 


U 
2 n 


-1 


In other words, P( u ) is the uniform distribution on A n (u). Now, for any such u, let Z^ be a random 
vector with law p( u \ then Z^ = 1Z(Z^) is supported on A n {u + 1). For any z n € A n (u + 1), let 
0 < i\ < ■ ■ ■ < i u +1 < 2 n be the positions of the zeros in z n , and let z„ , j = 1,..., u + 1, be the 
sequence in A n {u) obtained by changing the zero of z n at the position ij to a one. Then, 


u -\-1 

P(ZW = Z„)=^P(Z 
3 = 1 


(«) — z 


y{u) _Ah) 


y (u) 


Zn = 


U+ 1 


2 n—u \2n 


u 


-1 


u +1 
2 n 


= P(“ +I )(zn). 


Finally note that since we are considering the LCS, any single-entry change in z n changes the score 
by at most one. Hence, A = 1 in (13.91) and 


— 1 < £n(u + 1) — l n {u) < 1, for any u £ {0,1,... , 2n — 1}. 


(4.1) 


4.1 Lower Bounds Under Fast Convergence 

The following theorem is proved in [19 , Theorem 2.2] (see also [16 , Theorem 2.1] for a quantitative 
version with an arbitrary finite alphabet size). 


Theorem 4.1. There exist positive constants s 1 and £2 with £1 > £ 2 , and a set B n C A n x A n such 
that for every z n £ B n , 


L n (Z n ) — L n (Z n ) — 1 Z n — z n j > £ 1 , P ( L n (Z n ) — L n (Z n ) — —1 


Zn — Z n ) ^ £2- 
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Moreover, there exists po > 0, such that for every 0 < p < po, 

F(Z n eB n )>l-e- Cin , 

where c\ > 0 does not depend on n, but may depend on p. 

Hence (13.711 holds with eq = £i — £2 and A n (eo) < e~ cin , for n large enough. Furthermore, as 
shown above, (13.811 also holds for m = 0,1,... , 2n — 1. 

In what follows, we will apply the fast convergence result (Theorem 13.4p to two possible choices 
of li n , and then compare the corresponding lower bounds on moments and exponential moments of 
| L n (Z n ) p n |. 


4.1.1 The Standard li n 

Often (cf. [T9] , [21] and f ll) } the following choice of U n is used: 


U n 


2 nq — y/2 to, 2 nq + y/2 to 


n S n . 


This “standard U n ” is such that k$ = 1 (recall (13.31) 1. By the de Moivre-Laplace local limit theorem 
(cf. [271 PP- 56]), there exists a universal constant b = b{p ) > 0, which is independent of n, but 
depends on p, such that for n large enough, 


P(H n 



for any u G U n . 


(4.2) 


The (best) constant b depends on the choice of U n , and it is easy to see (see (14.1011 below) that 
for the standard U n as above, b > e 2 y / T. Without loss of generality, assume that 2 nq is a positive 
integer (if not, replace 2 nq in the definition oilA n with [2 nq]). Hence, with <p(n) = l/by/n, the 
condition (13.1011 is verified and thus, all the assumptions of Theorem 13.41 are satisfied. Therefore, 
recalling that 6 = Eq/2, E(4>(|L n (Z n ) — p n \)) is lower-bounded by 


Yi <S>(6\ui-a n \)F(U n = Ui)> Y 

Ui ElAji Ui r Un 


1 


1 


Ui Oy 


> 


1 


b\Jn 


y/2 n 

Y *wi) 


by/n by/n 
1 


2qn+\/2n 

k=2qn—\/2n 
yj2n 


by/n 


^0) + 2Y^f^f) ) , (4-3) 


j=-y/2n 

where the last inequality is obtained by minimizing over a n and since 2 nq is a positive integer. 
Example 4.2. For 4>(x) = \x\ r , r > 1, then (14.31) becomes 


2 r ~ 1 by/n 4-, 
J= i 


y/2n 

T,f> 


_r /*\/2n 

io_/ 

-'bV^Jo 


x r dx = 


e r 0 {2 n)( r+1 )/ 2 2( 3 " r )/ 2 er 


(r + l)2 r ~ 1 by/n b(r + 1) 


° n r /\ 


Hence, for n large enough (independent of r, see Remark 13.71) . 

o(3—r)/2_r 

E(\L n (Z n ) - p n \ r ) >di(r)n r/2 , where d/r) = di(r,p,e 0 ) := ——— 

b (r + 1) 


(4.4) 
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Example 4.3. For <3?(:r) = e tx , t > 0, then the second summation in (|4.3I) becomes 


y/2n 

E* 

3 = 1 

Hence, for n large enough, 


gpj 

2 


y/2n 

E' 

i=i 


= E e ‘ C0J/2 = E 4 


y/2n 

E. 

i=i 


a - p^ +1 

i -ft 


where pt := e £ ° t//2 . 


rr: lwz„,- u .l,W 1 / n 2 ft -2p^ \ ft(2p,' /S -ft 1 -l) 

V / 1-pt y 6(p t -l)v/n ^(l-^ 1 )^ 


(4.5) 


Now, for any t > 0, there exists n large enough (depending on t) such that 

E ( gH n (^ n ) — MnlA > ^Pt \pn 

V ) ~ b (pt - 1) n 

Taking t = s/y/n, with s > 0, gives a lower bound on the moment generating function of \V n 
\L n (Z n ) - iJ, n \/y/n. Moreover, by ([4~5l) . 


Since 


lirninf E (e s ^ Vn ^'] > liminf —\ - 11 

n^oo V / n—xx> b(l — p t ) y/n 



lim — (e £ ° s/V2 - 1 
stO £qS ' 


1 

72’ 


(4.6) 


the right-hand side of (14.61) approaches 2y/2/b as s 0. Since b > y/ne 2 , then 2y/2 < b, showing the 
deficiency of the obtained bound, since we naturally expect the right hand side of (|4.6I) to approach 
one as s 0. 


Remark 4.4. Since \ V n \ is non-negative, a series expansion, together with lower bounds on moments 
of \V n \, can also lower-bound its moment generating function. Indeed, by (|4.4j) . for n large enough 
(independent of r), 


E 


OO 

(e a ' v "l) = 1 + ^EdKH 


r =1 


> i + 


2y/2 


£0£ 
y/2 


r\ 


^ (r + 1)! 

r =1 


= l - 


2y[2 


+ 


beos 




/\/2 


- 1 


(4.7) 


This gives a lower bound similar to (14.61) . except for the additive constant 1 — 2y[2/b, which ensures 
that the right-hand side of (|4.71) tends to one as s | 0. 

Note finally that (14.61) is computed using the lower bound in (14.31) with 4>(x) = e tx and t = s/y/n: 

te 0 \j\ \ ' _ 2y/2n + 1 £q / s 

2 ) by/n “ 2 r ~ 1 by/nr\ \-^/n 




Above, the constant term (i.e., r = 0) converges to 2y[2/b as n ^ oo, which is different from the 
constant term (equal to 1) in the series expansion in (14.71) . The lower bound on the last series above 
can be computed using (14.41) : 


E 


2 r ~ 1 by/nr\ 



Al/sV 2( 3-r )/ 2 £Q r 

~ ^ rl VvW b{r + 1) n 


2y[2 4 

b be^s 


(e £ ° s/ ' /2 - l) . 


Therefore, the difference in the constants in (14.61) and (14.71) stems from the different constant terms 
(r = 0) in the respective series expansions. 
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4.1.2 The Extended U n 


The bounds presented in the previous subsection give the correct order for the centered absolute 
moments (given that p is small enough), i.e., E(|L n — p n \ r ) ~ 0(n r / 2 ). On the other hand, as 
argued at the end of Example 14.31 the lower bound on the moment generating function of \ V n \ can 
be improved. A way to do so (also valid for the moments of \V n \) is to extend the set U n and to 
refine the lower bound (|4.2|) . In the present subsection, we show how the refined lower bound yields 
better approximations. 

In what follows, take f3 € (1/2,2/3), and redefine U n as follows 


U n 


2 nq — (2n)P, 2 nq + (2n) /3 


ns n 


|k £ Z : |k — 2nq\ < (2n)^| . 


(4.8) 


In the sequel, this U n is called the “extended” U n , and note that again ko = 1 (recall (j3.3p ). Since 
(2 n)^ = o(n 2//3 ), it follows from the local de Moivre-Laplace theorem (cf. (271 pp. 56]) that 


sup 

k : \k—2nq\<(2n)P 


'( u n = k ) 


where 


4> n (k) : = 


: exp 


»(*) 


(k — 2 nqY 


- 1 


0 , 


and q = 1 — p. 


2y/irpqn l 4 npq j 

Hence, for every e > 0, there exists N € N such that, for any n > N, 

(1 + e) 4> n (k) > P (U n = k) > (1 — e) (j> n (k ), for all k € Z with |k — 2 nq\ < (2n) /3 . 


(4.9) 


In particular, 

p(t/ "= fc)a ^l exp (- e C :i ) =:,,(n) ' (4 - io) 

Since 2/3 — 1 < 1, it follows that 

y- l (n) e~ cin —> 0, whenever c\ > 0. 

On the other hand, by Theorem 14.11 there exists £o > 0 and c\ > 0, such that A„(eo) < e~ cin , 
for any n € N large enough. Thus, the condition (13.1011 holds and therefore, all the assumptions of 
Theorem 13.41 are verified. 

Thus, recalling that 5 = Eq/2 and a n € [2 nq — (2n)^,2nq + (2n)^], 

E($(|L n (Z n )-/x n |))> Y, $(S\k-a n \)F(U n = k)>(l-e) ^ 4> (5\k - a n |) (/> n (k), 

k£U n k:\k-2nq\<{2n)P 

for n large enough, where the inequality follows from (14.91) . From the symmetry of 4>(5| • |), the 
right-hand side above is minimized at a n = 2nq. Moreover, let 

jk(n ) := ——, for k £ Z with |k — 2 nq\ < (2n) /3 . 

V 2 n 
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Clearly, ( jk( n ))keu n forms a l/\/2n-net over the interval [—(2n)P 1 / 2 ,(2 n)P 1//2 ]. Hence, 


E($(|L n (Z n )-ju n |))> (1-e) ^ $ (<S |fc - 2nq\) <j> n {k) 

k : |fc—2ng|<(2n)^ 


= (i-0 


E 


-- $(5V2n\jk(n)\) 

nvan V / 


2y/irpqn 


exp 


k: \k—2nq\<(2n)P 

Example 4.5. For 4>(x) = \x \ r , r > 1, since tL 3-1 / 2 —>• oo, asn^ oo, 


Uk{n)Y 

2pq 


(4.11) 


lim —= 

MOO yJ2Tipq 


E 


1 


\/2n 


\jk( n )\ r exp - 


/c: |fc—2nq , |<(2n)^ 

Since e > 0 is arbitrary, (|4. 11[) leads to 

E (|F n (Z n ) — p n | r ) 


(jfcfo))' 

2pq 


1 


V 27 W J-c 


I r e -x 2 /(2pq) dX ' 


liminf E (|V^| r ) = lim inf ■ 

n—>oo n- ' — 


n 


r/2 


> 2 r /VE(|£| r ) = ^=(pg) r/2 r 

\/vr 


r + 1 


= : d2(r)> 


where £ ~ J\f(0,pq). Therefore, for n € N large enough, 

E(|L n (Z n ) -/inD > ^=(pq) r/2 T 


r + 1 


n 


r/2 


Let us compare this last bound with (14.4|) of Example 14.21 From (14.91) . 


- < — -— e 1 /l 2pi? ) implies that d\{r) < -———— - 
b 2^/irpq (r + l) y J'npq 


0_ e -i/(2 pq)' 


Hence, for each r > 1, d, 2 (r) > d\{r) will follow from 


(2 pq) 2 r 


r+i ( r + 1 


> 


2 ) r + 1 

To show (I4.12|) . set 6 := (r + l)/2 and 


e 1 /( 2p,? \ for p € (0,1) and = 1 — p. 


(4.12) 


fix) := x y e x Tid + 1), x := - > 2. 

y J y h 2pq~ 

Then (14.121) is equivalent to fix) > 1, for all x > 2. Since 

f'(x) = T(6 + 1) (^-ex- e ~ 1 e x + x~ e e x ^j = x~ e e x T{0 + 1) ^1 - 

has a unique zeros at x = 9, it is sufficient to show that 

f{0) = e~ 6 e 6 Tie + 1) = 0 1_ V T(0) > 1, for any 0>1. 

But, this follows from a classical inequality for the gamma function (cf. jTS 1 , Theorem 1]) 

T(x) af'-V 


r (y) p 2 ' 1 e x 


, for any x > y > 1. 


Therefore, d^ir) > di(r), for any r > 1. That is, with the extended Theorem 13.41 provides larger 


constants than the ones obtained with the standard U n . 
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Example 4.6. For & n (x) = & sx l'fi™ with s > 0, then 


Y 4> n (5\k - 2 nq\) (j> n (k) = 


k: \k—2nq\<(2n)P 


Hence, (14.1111 leads to 


\J2irpq y/2n 


E 


exp s5\j k (n)\ - 


k: \k—2nq\<{2n)P 


Uk(n)Y 

2 pq 


1 r (x M ^ 

V^PQJ-oc V 2 pq) 


n —» oo. 


(4.13) 


lhng;f E (e x p( J L "< Z J_- > E (e^'), 

where again £ ~ J\f(0,pq). Therefore, for any s > 0, 

liminf E (e a|Vn| ) > E ^o|«|/^ ( 4 . 14 ) 

Unlike the lower bound obtained in (14.61) . the present lower bound approaches one as s j. 0. Thus, 
again, the bound obtained via Theorem 13.41 with the extended tl n , outperforms the one obtained 
with the standard U n . 

Remark 4.7. Since cfo is given by the r-th moment of |£|, clearly 


OO 

1 + Y d2 ^~l = E (e aeo| * l/v ^) 


r =1 

and so in this case, the lower bound on the moment generating function obtained via the series 
expansion gives the same result as (14.1411 . 

Remark 4.8. There is another way to get (14.13|1 . By (14.91) . it is enough to prove that, as n -+ oo, 
Y <S> n {6\k-2nq\)F(U n = k) = Y exp j -jL \k - 2nq\ j P (U n = k) -> E^e sS ^ 

k: \k—2nq\<(2n)P k:\k-2nq\<(2n)P ^ 

To see this, note that by the CLT and the continuous mapping theorem, for any sGM, 


eXP (^ln \ Un ~ 2nq \) 


n —> oo, 


(4.15) 


D 


where “ —> ” stands for convergence in law. Moreover, for any e > 0, by Hoeffding’s exponential 
inequality, 


E ^exp T=d | U n — 2nq^j ^ = 1 + J P ^exp ^s<5(l + e)y/2n 


U n - 2nq 


2 n 


> x ) dx 


2n _ q 


( U, 

1 + l " 

poo 

<1 + 2 / exp ( —4n 


■I 

i 


Inx \ , 

-- I (1 nr 

~ s5{l + e)V^J 

(In x) 2 \ , 

1 dx 


= 1 + 2 exp — 


2s 2 5 2 (l + e) 2 n 
2 y 2 


s 2 5 2 ( 1 + e) 2 


+ y ) dy < oo, 
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which implies that the family of non-negative random variables 

( s5 


exp - 


| U n — 2nq\ : n € N 


\y/2n 

is uniformly integrable. Let A n := {U n 0 U n }. Again, Hoeffding’s exponential inequality leads to 
P (A n ) = P (\U n — 2nq\ > (2n) /3 ^ < 2exp 2(2n) 2/3_1 ^ 0, n —>• oo, (4-16) 

since 2/3 > 1. Therefore, for any s € M, 


lim Y exp ( |k — 2nq\ ) P (U n = k ) 

n— >■ o° z J \ x/2n 

k : \k—2nq\<(2n)P 


= lim E 1 A c exp 


( sS 


| U n — 2nq\ = lim 


\y/2n 

where the last equality follows from (14.151) and the uniform integrability. 


E ^exp I U n - 2nq\ S j ^ = E (e sS ^ 


4.2 Uniform Approximation 

We next show that in the binary LCS case, the uniform approximation approach (Theorem 13.61) 
also applies provided that U n is standard. Indeed, with the standard U n , we have m(n ) = 2y/2n 
(ignoring the rounding), and (f(n) = 1 /(by/n), so that the condition m{n) > c<^ _1 (n) = bcy/n holds 
with c = 2y/2/b, and thus Theorem 13.61 applies. Therefore, 


Zip 1 (n) 


E($(|L n (Z n )-// n |))> y 4> 


3 = 1 


£o 


2 \8 <p(ri) 


° +j)W(n) + V £ ° C 


8 ^16^(n) J 4 \16^>(n) 


(4.17) 


Again, let us apply this result to |L n (Z n ) — n n \. 

Example 4.9. For <L(x) = |x| r , r > 1, the rightmost bound in (|4.17j) gives 

er 




0 n r '\ 


2 (5r+l)/2 b 


Note that 


3 —r 


d,(r) = d 3 (r,p, So ) := 2(5r+1)/2 = *M. 

where d\(r) is the constant given in (14.41) . This last fact is quite expected since we had a cruder 
approximation and a weaker assumption A n (eo) —> 0, as n —» oo. 

Next, applying the finer lower middle bound in (I4.17P gives 


1^ 2 ( n ) , 

E *(? 


3 = 1 


\ 2 \8 <p(n) 


/ \ i ^/(2V2) , n 


+ 3 + 


> 


b2 r y/n 


L 


\fn/ ( 2 \/ 2 ) / 


/_vm 

V2V2 


+x dx + 


e r 0 n r ! 2 

2 (5r+3)/2 5 

eC n r / 2 


2 (5r+3)/2 


2 (5r+3)/2fr ( ' 


/2 r+1 - 1 


r + 1 


+ 1 ) n r//2 . 
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Once more, a better approximation leads to a larger constant 


di{r) = d 4 (r,p,e 0 ) := b l * £ r 0 2 2 


sr+3 / 2 r+1 - 1 


r + 1 


+ 1 = 


2 r+1 + r 

2(r+ 1) 


dz{r) < di(r). 


Example 4.10. For 4>(x) = e tx with t > 0, the rightmost bound in (14.171) gives 


E ([_ <|) ( ~ 11 ‘ ) — jl £ot\/n/(4\/2) 

V / 4 \16</?(n) / V"2 6 


V16 pin)) spib 

while the more refined middle bound, with pt = e £ ° t//2 , gives 


(4.18) 


£*(? 

j=i 


\ 2 \8p(n) 


+3 <p(n) + 


8 \16<^(n) 


e 0 c 


O e 0 ty/E/(4V2) , 

E 4 + 0 

1=1 


6 \/n 

Pt 


2\/2 b 


Pi 


VF/( 2\/2) 


Pt 


vW\/2 J^/{2y/2)\ Pt 


V^/( 2V2) 


~Pt 




+ 


2y/2b 


by/n{pt- 1 ) 

For every t > 0, the convergence to infinity (as n —>• 00 ) of this last bound is slower than that of 
the bound (14.51) . Hence, again, the uniform bound is smaller (for large n) than the ones obtained 
in the previous subsection using Theorem 13.41 Since t > 0, for n large enough (depending on t), 


E e 


\Ln(Z n ) — tJ.n\t\ > ^ Ent N /n/(4\/2) Pt 


\V^+1 E0 tyfii/(4y/2) 


> b e ~ 


+ 


> I e 0 ty/n/A 1 
“6 


a eoty/n/ (4%/2) 


Pt - 1 ' 2>/2 b ~ b 2y/2 b 

Remark 4.11. Again, when estimating the moment generating function of \V n \ = |L n (Z) — p n \/y/n 
via its series expansion, with the lower bound E(|F) l | r ) > d^(r), for large n (uniformly in r), 


1 °° 1 / 

<^') £1 + - E E i -5 


V2b^r\\4y/2 S ) y/2 b 


o £0S 


/(4V2) 


+ 1 - 


1 


s/2b' 


which is the same bound as in (14.181) with t = s/y/n, except for the constant term. 

Remark 4.12. With the extended ld n , the uniform approximation fTheorem l3.6l) cannot be applied, 
since there does not exist a constant c > 0 such that m(n) = 2(2 n)P > cp~ l {n), for any n € N, 
with ip(n) given in (14.101) . 


Remark 4.13. We have obtained several constants d^(r) < d±{r) < d\{r) < d 2 (r) involved in lower 
bounding the moment E(|F ra | r ). The best constant, d 2 (r), is obtained under fast convergence with 
extended U n . Clearly, in order to deduce the existence of c(r) such that E(|14| r ) > c(r), it suffices 
to tackle the case r = 1. However, it is easy to verify that in all the cases considered above (i.e., 
c(r) being either di(r), d 2 {r), ds(r ) or 0(4 (r)), the constant (c(l)) r is smaller than c(r). Moreover, 
in all these cases, it is also true that c(r + 1 ) > (c(r))^ r+1 ^ r , so that a better constant is obtained 
when estimating the r-th moment directly rather than estimating a lower-order moment. 


5 An Upper Bound on the Rate Function 

5.1 Background and Preliminary Results 

The analysis right below (before Proposition 15.11) is similar to [H Theorem 2], but with a more 
general notion of score function and a slightly different definition of gap@. Again, let (X n ) nG ^ and 

1 In [2, the scoring function takes the value 1 for matches and the penalty —p for mismatches. Moreover, the gap 

in [4] is an indel in one sequence, and the gap price —S is assumed to be negative. 
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O'rOneN be two independent sequences of i.i.d. random variables, and let us consider a general 
scoring function S : A x A —> M + . By subadditivity, 


T >» T T ('srn+m.~ v -n+m\ 

^n+m E ^ f-'Si+l > Y n +1 J > 

where L(x;[+;»; Y”+I>) denotes the optimal alignments score of X”+™ := (X n+ i,... ,X n+m ) and 
:= (Y„+i,... , Y n+m ). Hence, for any s > 0, 

P (L n+m > s(n + m)) > P (L n > sn, L (X£+™; Y™+?) > sm) = P (L n > sn) P (L m > sm). 


Thus, by Fekete’s lemma, for any s > 0, the following limit - the rate function - exists: 


1 1 

r(s) := lim-InP (L n > sn) = inf-InP (L n > sn) < oo. 

iho o n neN n 


and so, 


Since for any n € N, 


P (L n > sn) < e r ( s ) n 5 for any s > 0 , n € N. 


E (L n ) < 7 *n, where 7 * := lim —-——. 

71—> OO Ti 


it follows from ( 12 . 21 ) that for any s > 7 *, 


P(L n > sn) = F[L n -K(L n )> I s - ^ Lri> ) n ) < p (L n -E(L n ) > (s-j*) n) < expi¬ 


re 


(s - 7*) 2 re' 

If 2 r 


Therefore, for s > 7 * 


Moreover, for 5 = 7 * 


r(s) > 


( s - 1*) 2 

K 2 ' 


(5.1) 


P (L n > 7 *n) = P L n - E(L n ) > 7 * - 


E(L n ) 


n 


n 


n ) < exp ( 7 - 


E (L r 


re 


The aim of the present section is to show that the methodology developed to date allows us to 
partially reverse (|5.1D in the LCS case. That is, we will show that there exists a universal constant 
B > 0, such that 


r(s) < B(s- 7 *) 2 , 


(5.2) 


for any s belonging to an upper neighborhood of 7 *. Moreover, the full claim (15.21) . i.e., for all 
s > 7 *, is shown to hold under a uniform convergence assumption, which is not proved in general. 

Hereafter and throughout this section, we will only consider the binary LCS case. For the rest 
of this subsection, we obtain a lower bound on E(e i ( Im( ' ZT, ^ n ( 2 ™j))) ( no t e that there is no absolute 
value in the exponent), which will be used to prove the claim (|5.2|> in the neighborhood of 7 *. Since 
we are considering the special case of LCS and since we have already seen the advantage of the 
extended U n over the standard one, we shall use the extended U n . Throughout this section, let 
Po > 0 be given as in Theorem 14.11 
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Proposition 5.1. Let t > 0 and let p € (0,po)- Then, there exists £g > 0, such that 
Inn inf E ^exp {L n {Z n ) - £ n {2nq))^j > 


where 


A = A(£n) := min 1 — 
t >o ' 


pq £ot 


< £ < pqt e (o, l), 


(5.3) 


and where £ J\f(0,pq). 

Proof. For simplicity, assume 2nq to be an integer. Let U n be given by (14.81) . By the conditional 
Jensen’s inequality, 

E^gh^ ,n ( Zrl ) — ^ £n(2nq))^j s, jg Li(2ng))^ ^ ^ ( gt(^"(k) — Li(2ng))jp ^ 

k&An 

2nq+(2n)P 2nq—l 

= J2 e t ^ k) - e ^ 2nq)) F{U n = k)+ Y e t ^ k) ~ e ^ 2nq)) F(U n = k ). (5.4) 

k=2nq k=2nq—(2n)P 

Theorem IXT1 ensures that, with £o := £i — £ 2 , the condition (13.101) holds and therefore, all the 
assumptions of Theorem 13.41 are satisfied. If k G U n and k > 2 nq, then by Theorem 13.41 £ n (k) — 
£ n (2nq) > 5{k—2nq), where 5 = £o/2 < 1; while if k < 2nq, then by (14.11) . £ n (k) — £ n {2nq) > k — 2nq. 
Hence, by (14.91) . for any e > 0 and n (depending on e, but not on t) large enough, 

2nq+(2ri)P / ^ N 2nq+(2n)P 

Y exp 

k=2nq 

and 


t 


\f2n 


(£ n (k) -£ n (2nq))^j F(U n = k) > (1 - e) Y exp (^-^= (k - 2nq)^ 4> n (k), 


k=2nq 


2nq-\-(2n)P 


Y exp (^ 7 ^ ( k - 2n ^)) &*( k ) -> ^1 + erf e OT£ o i2/8 . 

k — 2nq 

Similarly, for n (depending on e, but not on t) large enough, 

2nq —1 , v 2nq —1 

S exp 

k=2nq—(2n)P v v 7 k=2nq—(2n)P 

and 


/ \ 2nq— 1 , v 

( Vln ~~ ^( 2n ^)) J F ( Un = k )>( 1 ~ e ) Y exp \ ~J2n ^ ~~ 2n? 7 ^ n ^)> 


2nq — 1 

S exp 

k=2nq—(2n)P 


. \/2n 

Since e > 0 is arbitrarily chosen, 


{k - 2nq)^j <f> n (k) -> ^1 - erf e pqt2/2 . 


lim inf E 

n—>oo 


(L n (Z n ) - £ n {2nq)) 


> 


> 


( exp (; 


e pqelt 2 /S 


= ( l-F(pqdt<^<pqt))e pq£ o t2 / 8 , 


where £ iV(0,M)- 


□ 
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To conclude this subsection, we provide an estimate on |E(L n (Z„)) — l n (2nq)\/\/2n, which will 
be useful in the sequel. 

Corollary 5.2. In the binary LCS setting, with p € (0,po), 


In particular, 


|E(L n (Z n )) -I n (2nq)\ 2 pq 

limsup- -= - < \ -. 

n —>oo \/2n V vr 


lim = lim = 7 *. 

n—»■ oo Tl n—>■ oo Tl 


Proof. By (14.ip . 

Kn(^) - 4 ( 2 ng)| < |/c - 2 ng| , k € W n . 
Hence, for any e > 0 and n large enough, 


E ~f: ( 2 " 9 > ' F ( u n = tj < a+«) E 

kGUn *^ k&Un ^ 2U 


Again, 


and by (14.161) . 


E 

k^ilAn 


y/2n 

| k — 2nq\ 
y/2n 


i(k) -)• J 


\x\ 


e = \l-^’ 


EMh4M p([fii = l)$ » 


k0A n 

Thus, 


\/ 2 n 


\/ 2 n 


-oo 

(U n 0 77 n ) < \/2nexp — 2(2n ) 2/3_1 


(5.5) 


0 , n —>• oo. 


r |E(L n (Z n )) -4(2ng)| E(|4(t/ n ) -£ n (2nq)\) 2pq 

lim sup- j= - < hm sup-=- < 


\/2n 


y/2n 


7r 


which clearly implies (15.51) . 


□ 


5.2 An Upper Bound on the Rate Function in the Neighborhood of 7* 

The goal of this subsection is to prove the claim (15.2|) in the neighborhood of 7 *, i.e., for some 
constant B > 0 and 7 > 0, 

r(s) < B (s — 7*) 2 , for any s € ( 7 *, 7 * + 7 ]. (5-6) 


Let 

A n (i) := logE(V L ™ (Z ’ l) ) , t G K. 

The following result of HU (see also Ha Theorem 1]) will be useful in the sequel of the proof: there 
exists A(f), t£l, such that 

lim — — = A(f), for any t € M. 

n—>-oo Tl 

Moreover, r(s) and A(t) are convex functions and are related via 

r(s) = A*(s) := sup (ts — A(f)), A (t) = r*(t) = sup (ts — r(s )), for all s € M and t > 0. (5.7) 
t >0 sGK 
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5.2.1 A Global Upper Bound Under a Uniform Lower Bound 

Recall from Proposition 15.11 that 


lirninf E 

n —>oo 


t 


exp 


\y/2n 


(L n (Z) - 4(2 nq)) 


> A e pq£ o t2/s . 


where A is given by (15..ID . Hence, for every 0 < Ai < A and t > 0, 

t 


E 


exp 


y/2n 


(L n ( Z n ) - 4(2 nq)) 


> Aie OT£ o* /8 , 


(5.8) 


provided that n (which depends on t) is large enough . 


Proposition 5.3. Let (15.8D hold uniformly over [0, oo), i.e., let there exists no € N (independent of 
t) such that, whenever n > no, (15.8D holds true for every t > 0, then (15.2D holds true for all s > 7*. 

Proof. Let A 2 := pqe q/8. Since (15.81) holds uniformly in t, for n > no, 

An f-L") > +lnA! + A 2 t\ for any t > 0. 

V\/2 nj _ y/2n 

With u = tj\pln. , for n > no, 


K(u) £ n {2nq)u InAi 2 

- >-1-b 2 A 2 u , for any u > 0, 

n n n 


and thus (recalling (|5.5|) f . 


A(n) = lim L ^ > j*u + 2 A 2 u 2 , for any u > 0. 


n—¥ 00 ft 


Now, for every s > 7 *, u > 0, 


su — A (it) < (s — 7 *) u — 2 X 2 u 2 , 


which implies that, 


fg _y*)^ 

r(s) = sup (su — A(u)) < sup [(s — 7 *) u — 2 A 2 U 2 ] =-—-. 

u >0 u >0 8A2 


Therefore, (15.2D holds true, for all s > 7 *, with B = 1 /( 8 A 2 ) = 1/(4 pq5 2 ). 


□ 


5.2.2 Binomial Approximation With the Further Extended U n 

In this part, we derive (15.6D without assuming that (15.8j> holds uniformly in t. Let M q (t) and 
K q (t ) be respectively the moment generating function and the cumulant generating function of the 
Rademacher law with parameter q, i.e., 

M q (t) := e^^q + e~ tq (l - q), K q (t ) := In M q (t) = In ( e t{l ~ q) q + e- t9 (l - qfj . 

We start with the following general theorem. 
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Theorem 5.4. Let p £ (0,po)- Let there exist t 0 > 0 and 5 > 0 such that, for any t € [0,fo]? 
whenever n is large enough (possibly depending on t), 

£ f e t( L n(Z n )-£n(2nq))\ > v n M% n (St), 


(5.9) 

where (i'n)neN C [0, oo) is such that lnz/ n = o(n), as n —>■ oo. Then, there exist constants B > 0 
and y > 0 such that (ESI) holds true. 

Proof. Let t € [0, to]- Taking logarithms in (|5.9I) leads to 

A n (t) = InE > t£ n (2nq ) + 2nK q (5t). 

Dividing both sides of the above equality by n, and using (15.51) . lead to 

A(t) > fy* + 2 K q (dt). 

Now for any s € R, 

ts — A (t) < ts — iy* — 2 K q (5t) = t(s — y*) — 2 I\ q (5t), 

which implies that 


(5.10) 


sup (ts — Aft)) < sup [f (s — y*) — 2i£g(<5f)]. 

te[o,i 0 ) te[o,t 0 ) 


(5.11) 


We next show that A'(0+) = y* (recall that Aft) and r(s) are related to each other only for 
t € [0, oo), see (|5.7D ). First, by (I5.10p . since K q (5t) > 0 for any t > 0, we have 


Aft) > y *t, for any f > 0. 


(5.12) 


Next, by definition, r(s) = 0 for any s < y*. As stated in m Theorem 1], r is a convex and, 
therefore, continuous function, and so r(y*) = 0. Thus, the function r is equal to zero up to y* and 
is strictly increasing afterwards, and so, 

A(t) = sup (ts — r(s )) = sup (ts — r(s)). 

s> 7 * 


Using (15.11) with K = 1 in the LCS case, for any t > 0, 

A(t) = sup (ts — r(s)) < sup ts — (s — y*) 2 = tst — (st — y*) 2 = ty* + -t 2 , (5.13) 

s> 7* s> 7* -* 4 

where s t := t/2 + y* > y *, t > 0. Combining (15.121) and ()5.131) . it follows that A'(0+) = y*, which, 
together with (15.111) . implies that there exists yi > 0, such that for any s € (y*, y* + yq], 

r(s) = sup (ts — Aft)) = sup (ts — Aft)) < sup [t (s — y*) — 2K q (5t)] . 
t> o te[o,t 0 ] te[o,t 0 ] 


Now, 


sup [t (s - y*) - 2K q (5t)} 
[o,to] 


2 sup 

te[oy 0 ] 


St ( q + 


s — y 
25 


= 2 sup 
w£[0,(5to] 


U 



26 ) 


- In ( qe tS + (1 - q)j 

- In (qe u + (1 - q)) 
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With 


x := q + 


s — 7 
2<5 ’ 


we obtain that the solution of 


x = 


qe 


is 


qe u + (1 - q) 
x(l — q) 


u{x) = In . 

q{ 1 — x) 

Clearly, there exists 72 > 0, such that when s — 7 * < 72 , u(x) G [0, <5to], and in this case 

sk 

S — 7 


sup 
uS[0,(5to) . 


u \ q + S ^ ) - hi {qe u + (1 - q)) 


= sup 

u>0 


u ( q + 2§ ) - In (qe u + (1 - q)) 


= u(x)x — In (qe u ^ + (1 — q)^j 

, x 1 — x 

= x in —b (1 — x) in- 

q 1 - q 

= ■ D{x\\q). 

Since for w > 0, ln(l + w) < w, and since for w € (0,1), ln(l — w) < —w, it follows that 

1 


D ( x \\q) X )^l = {x ~ q)2 = _ 

q [ } l-q q(l-q) 4S*q{l - q) 

Therefore, (1 5.6 jl holds with B = 1 /(4<5 2 g(l — q)) and 7 = min( 71 , 72 ). 


(« - 7*) ■ 


□ 


Remark 5.5. By examining the above proof, it is easy to see that if (15.91) holds for any t > 0, then 
m holds for any s > 7 *. 

Let us return to (15.91) and examine under which conditions the hypothesis of Theorem 15.41 are 
satisfied. Since U n B(2n,q), 

2 n 


e tS(k-2nq) F = k ) = Mq n (5t), 


k =0 


one might try to use the first term in ((5.41) to get ((5.91) : 

IE Li(2nq))^ n, £n(2n5))p 


Thus (15.91) holds if, 


fceWn, k>2nq 


2nq+(2n)P 

J2 e tS ( k ~ 2nq) F ( U n = k) >u n M 2n (5t), 

k=2nq 
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for a sequence (i'n)neN of positive reals such that In u n = o(n) (as n —>• oo). Unfortunately, there 
does not exist such (u n ) n& n, since 

/ 2nq+(2n)P \ 

- In e t5{ - k ~ 2nq) F (U n = k) 1 < - In (exp (t<5(2n) /3 ) ) 0, 

\ k=2nq I 


n —> OO. 


To show a similar requirement, we need to further enlarge U n . The following lemma shows how to 
do so. 

Lemma 5.6. For any g > 0, there exists b := b(g ) > 0 such that for every n € N large enough, 


Proof. Since U n ~ B{2n,q), 


inf F{U n = k) > e _T?n 

k : \k—2nq\<bn 


2n \ k, 


P (U n = k)=[ ) q*( 1 - q) 2n ~ k , k = 0 ,..., 2n. 


Next (cf. [9j Example 12.1.3]), 


2 n\ 1 / , / k 

> -exp 2 nh e \ — 

k ~ 2n +l V \2n 


, k = 0,..., 2 n, n € N, 


where h e is the binary entropy function of base e: 

h e {q) = -q\nq- (1 - q) ln(l - q), q € ( 0 , 1 ). 

Hence, 


'{U n = k)> 


1 


2 n + 1 


exp 2 n 


hr 


k 


+ In q + I 1 — 


2 n / 2 n 


A 

2 n 


ln(l - q) 


The continuous function g(x) = h e (x) + xlng + (1 — x) ln(l — q), x € (0,1), is such that g(x) < 0, 
for any x € (0,1), with g(q) = 0. Thus, for any r/ > 0, there exists b{rj) > 0 such that, whenever 
\x — q\ < 6/2, then g(x) > —g/A, and so, if | k — 2nq\ < b(g)n, 

k \ k , / k \ 1 /i ^ rj 


K ^) + *l lnq+ V-2^ ln(1 - ql> -T 


Therefore, for n (depending only on g) large enough, 


inf P (U n = k) > - 

k:\k-2nq\<bn 2 Tl + 1 


exp 


k 

2n h P I — 
2 n 


q k ( 1 - q) 2n ~ k > —— exp (~-n)> e~ qn . 
2n ~ t - 1 V 2 


The proof is now complete. 

Using Lemma 15.61 we can now verify the condition (15.911 of Theorem 15.41 


□ 


Theorem 5.7. Let p £ (0,po). Let eo := £i — £2 > 0, where £1 > 0 and £2 > 0 are given in Theorem 
4-l\ and let 5 := £ o/2. Let to > 0 be the unique positive solution to 


25t - b 2 - 2 In M q (St) = 0 . 


(5.14) 


Then, for any e > 0 and any t € [0, to]; there exists N(t ) € N such that for any n > N(t), (15.91) 
holds true with v n = 1 — e, n € N. 
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Proof. By Lemma 15.61 the set U n can now be further enlarged. Indeed, taking 0 < rj < c± (where 
ci > 0 is given in Theorem Id. ip . there exists b := b{rf) > 0 such that 


0 —cin 


sup 

k: \k—2nq\<bn ™ Wn = «) 


< e (v~ci)n _> 


n —>• oo. 


Let us therefore take 


U n = [2nq — bn, 2 nq + bn] O S n 


With tp(n) = e~ vn and by Theorem 14.11 the inequalities (13.1011 (with £o = £\ — £ 2 ) and (13.1111 
are both satisfied. Since in our LCS case (I3.8|l also holds, all the assumptions of Theorem 13.41 are 
verified and thus, for any k £ U n , £n(k + 1) — £ n (k) > 5 = £q/2, provided that n is large enough. 
Therefore, by the conditional Jensen’s inequality, 


2nq-\-bn 

E (2?1C?)) ^ ^ ^ gt(£n{k) — £n( 2ng))jp 

k=2nq 


2nq-\~bn 

> J2 e «(fc-2nq)p (u n = k )=: Sg\t). 

k=2nq 


(5.15) 


On the other hand, 


S n (t) := M q {5tf n = E f e St{Un-2ng)\ = \ e tS{i~9) q + e -««( 1 - g ) 


1 2n 


2nq—bn— 1 2nq— 1 

= ^ e 5i(fc-2n ? ) p (y n = fc ) + ^ e 5t(fc-2nq) p (^ = fc ) 

/c=0 k—2nq—bn 

2 nq+bn 2 n 

+ J2 e St{k ~ 2nci) ¥ (U n = k) + Y1 e St{k ~ 2ng) F (U n = k) 

k=2nq k=2nq-\-bn-\-l 

=-.S^\t) + S^{t) + S^(t) + S^{t). 


It is easy to see that for every t > 0, 


2nq — 1 

S {2) (t)< J2 e 

k=2nq—bn 

Moreover, for any t > 0, 


nb 


w _ p —t8 _ —t8(nb) 

t8{k—2nq) _ \ A p —t8j _ ^ ^ 


= E e ~' = 

3 =1 


1 - e~ ts 


M _ i 


= : c(f), n —>• oo. 


( 2nq—bn—l 2n \ 

| e t5(fc_2n9) P(t/ ri = A:) < e 2l5tn p(|[/ n _2ng| >nb) < 2 e ( 25t_f,2 ) n , 

/c =0 fc= 2 ng+ 6 n+l y 

where the last inequality follows from Hoeffding’s exponential inequality. Thus, for t > 0, 

i_ s cw + +2exp((24t __ 2UMq{U))n) 

un\b) ^n\b) ^n\b) 
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Hence, for any 0 < t < to, where to is the solution to (15.141) . 

S ( n\t) 

W ’ asn ^”' 

Together with (15.151) . it follows that, for every t €E (0,fo], there exists N(t) € N, such that for any 
n> N(t), 

E nq))^ > Sn( fl > (]_ _ e )M 9 (<5t) 2n , 

which completes the proof. □ 

6 Beyond the Binary LCS Case: Discussion 

In this paper we applied the general methodology developed in Section[3]to the binary LCS case with 
strongly asymmetric distributions. From Theorem 14. 11 this setup is one of few known models where 
the existence of a suitable random transformation and of £o (such that (13.81) and A n (eo) —>• 0 both 
hold) have been proven to exist. In the binary LCS case, Theorem 14.11 assumes a very asymmetric 
distribution, and further research is needed to relax this asymmetry assumption. 

Other setups where the assumptions of Theorem 13.41 have been proved to hold were studied 
in m m and [22] • For example, in [21] and [22], the random variables (X n ) ne N and (T n ) n eN 
are all i.i.d., every letter /3 € A is taken with a positive probability (i.e., P(Xi = (3) >0), and the 
scoring function 5 as well as the distribution of X\ do satisfy the following asymmetry assumption. 

Assumption 6.1. There exist a,b € A such that 

Y P ( X i = P) ( 5 (M) - > °- (6- 1 ) 

/3&A 

For the binary alphabet A = {a, 6}, condition (jfi.ll) reads 

(5(6, a) - 5(o, a)) P (X 1 = a) + (5(6, b) - 5(6, a)) P (X x = b) > 0. 

Since S is symmetric and one could exchange a and b, the condition ([6. ID actually becomes 

(5(6, a) - 5(a, a)) P {X x = a) + (5(6,6) - 5(6, a)) P (X 1 = 6) + 0. 

When 5(6, 6) = 5(a, a) > 5(6, a) (recall that 5 is assumed to be symmetric and non-constant), then 
Assumption 16.II is satisfied if and only if P(Xi = a) A P(Xi = 6). Thus, in terms of the distribution 
of Xj and of the scoring function 5, (16. ID requires much less than Theorem 14. II However, the price 
to be paid there is in terms of 5. Namely, the analogue of Theorem 14.11 can be shown to hold only 
if the gap penalty —5 is sufficiently high. The theorem itself is as follows (cf. |21l Theorem 3.1]): 

Theorem 6.2. Let Assumvtion \6.1\ hold. Let a,b £ A be as in m, and, let the random transfor¬ 
mation 1Z turn the letter a in Z n into the letter 6. Then there exist constants 5o < 0, eo > 0, a > 0, 
and no S N, such that for any 5 < 6 o and n > no, 

¥(E(L n (TZ(Z n ))-L n (Z n )\Z n )>eo) >l-e~ an . 
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Remark 6.3. Typically <5 q < 0, so that the condition 5 < 6 o indicates that the gap penalty —6 has 
to be sufficiently large. Hence, the result does not apply to the LCS case. Intuitively, the larger the 
gap penalty (the smaller the gap price), the fewer gaps appear in the optimal alignment, so that 
the optimal alignment is closer to the pairwise comparison (Hamming distance). Some methods for 
determining a sufficient <5o, as well as some examples, are discussed in [22]. We believe that the 
assumption on 5 can be relaxed so that Theorem 16.21 holds under more general assumptions. 

If the assumptions of Theorem 16.21 hold (i.e., the scoring function S and the law of X\ satisfy 
Assumption 16.II and 5 is sufficiently small) and the alphabet A = {a, b} is binary, then the situation 
is exactly as in the binary LCS case considered in Section 0] except that A (see (|3.9lP might not 
be 1. However, the constant A is not essential in any of the proofs, so all the results of Sections 
0] and [5] continue to hold (with appropriate changes stemming from A). If |M| > 2, then to apply 
Theorem 16.21 one should use a more general version of Theorem 13.41 see [2H Theorem 2.2]. The 
same theorem should apply to the LCS case when the alphabet is not binary, but the model is still 
strongly asymmetric, i.e., one letter has to have a very large probability. 
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